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Dear Editor-in-Chief, 


Wang et al. (2006) analyzed the spike gene 
sequences of SARS coronavirus (SARS-CoV) from 
the recent sporadic (December 2003—January 2004) 
and 2002-2003 epidemic human cases and SARS- 
like-CoV from civet cats. The authors claimed that 
SARS-CoV strain GDO03T0013 from the recent 
sporadic cases (WHO, 2004) was genetically closer 
to the human SARS-CoV from the early phase of the 
2002-2003 epidemic than to the wild animal SARS- 
like-CoV from the 2002-2003 epidemic. Wang et al. 
(2006) therefore concluded that the recent sporadic 
human SARS-CoV was closer to an unknown SARS- 
CoV predecessor, which is remarkably different from 
the conclusions of previous studies (Kan et al., 2005; 
Song et al., 2005; Wang et al., 2005). A drawback of 
the study by Wang et al. (2006) is the exclusion of a 
number of civet cats SARS-like-CoV sequences, 
leading to the inability of their analyses to fully 
delineate the phylogenetic origin of strain 
GD03T0013. 

To clarify the phylogenetic origin of strain 
GD03T0013 we analyzed the full length spike gene 
nucleotide sequences (n = 60) of human SARS-CoV 
from both the recent sporadic and 2002-2003 
epidemic cases, as well as SARS-like-CoV isolated 
from wild animals (civet cats, raccoon dogs and bats). 
In particular, our dataset included the SARS-like-CoV 


isolated from civet cats in the 2003-2004 epidemic, 
which were not in the dataset of Wang et al. (2006). 
The sequences were aligned with ClustalX 1.83 
(Thompson et al., 1994) and gap columns were 
removed, generating an alignment of 3672 bp. 
Phylogenies were reconstructed from the alignment 
using three methods, including neighbor-joining (NJ) 
implemented in PAUP* 4.0b (Swofford, 2002), 
maximum likelihood (ML) implemented in PhyML 
2.4.4 (Guindon and Gascuel, 2003) and Bayesian 
Markov Chain Monte Carlo (BMCMC) implemented 
in MrBayes (Ronquist and Huelsenbeck, 2003). The 
substitution model used was the best-fit model 
suggested by ModelTest 3.7 (Posada and Crandall, 
1998). Five thousand bootstrap replications were 
performed in both the NJ and ML methods, whereas 
two sets of four tempered MCMC chains of 550,000 
generations sampled every 100th generation with 
initial 10% burn-ins were used in the BMCMC 
method. 

The topologies of NJ, ML and BMCMC phylo- 
genies are essentially similar. Therefore, only the ML 
phylogeny is presented here and the confidences of its 
topology were summarized from ML and NJ boot- 
strapping, as well as the sampled trees in BMCMC 
chains. The summarized ML phylogeny (Fig. 1) shows 
that strain GDO3T0013 shares a monophyletic 
relationship with the wild animal SARS-like-CoV 
cluster B (isolated in 2003-2004 epidemic) and this 


0378-1135/$ — see front matter © 2007 Elsevier B.V. All rights reserved. 


doi: 10.1016/j.vetmic.2007.08.014 


Letter to the Editor/Veterinary Microbiology 126 (2008) 390-393 


@ Rp3 (0Q071615) 


Lew 


(99/97/99) 
(72/81/87) 
(99/98/97) 


0.002 
substitutions / site 


Wm Sin852 (AY 559082) 
@ Sin845 (AY 559093) 
@ Sin849 (AY 559086) 

ml Sin2679 (AY 283796) 

Mi CUHKtc49NP (D0412624) 

i CUHK-AG03 (AY 345988) 

ml LC5 (AY395002) 

ll TWS (AP006560) 

Ml 2J01 (AY 297028) 

Mi 2J02 (DQ231462) 

Wi BU04 (AY 279354) 

Wl Sin2774 (AY 283798) 

m@ TW11 (AY502924) 

i TOR? (NC_004718) 

I GD69 (AY313906) 

ll FRA (AY310120) 

Wl HZS2-C (AY 394992) 

Mi CUHK-Su10 (AY 282752) 

i WHU (AY394850) 

i HKU-65806 (AY 304493) 


Wi HZS2-Fc (AY394991) 
Wl WH20 (AY772062) 

i BJ202 (AY864806) 

Wl HSZ-Cc (AY 394995) 
Wi HSZ-Bc (AY394994) 


W@ GZ-A (AY394977) 
i GZ50 (AY 304495) 
i CUHKtcO6NP (DQ412590) 
ml HSZ2-A (AY 394983) 

Wi BJ02 (AY278487) 


mi JMD (AY394988) 
ll GZ60 (AY 304491) 
@ ZS-B (AY 394996) 


Wi GZ02 (AY390556) 


Hi GD01 (AY278489) Ss 
os 

O SZ1 (AY304489) ca 

O $Z16 (AY304488) as 

A 8213 (AY304487) ss 

O $Z3 (AY304486) 2S 

Wi GZ0402 (AY613947) >3 

O PC4-137 (AY 627045) » 


O PC4-241 (AY627048) 

O PC4-145 (AY627046) 

O B012G (AY687359) 

Wl GZ0401 (AY 568539) 

Wi GD03T0013 (AY 525636) * 
O HC/SZ/266/03 (AY 545916) 
O PC4-115 (AY627044) 

O PC4-136 (AY613949) 
© B039 (AY687363) 

O PC4-13 (AY613948) 
O C025 (AY 687370) 
O civet010 (AY 572035) 


O civet007 (AY572034) 
O civet019 (AY572037) 
A A022 (AY687373) 


wm LY (AY322207) 


I GDH-BJHO1 (00640652) 


(d1wapida 
£002-2002) 


O HC/SZ/61/03 (AY 515512) 


Ja}Ssnjo Dawpids uewNy 
(21wWapida ¢00Z-Z00Z) 


g J9}SN]O |eEWIUe-p]IM 
(o1wapids 700Z7-¢€00Z) 


391 


Fig. 1. Phylogenetic tree of spike gene nucleotide sequences from SARS-CoV and SARS-like-CoV isolated from humans, civet cats, raccoon 
dogs and bats. Sequences from humans, civet cats, raccoon dogs and bats were indicated with symbols (I), (O), (A), and (@), respectively. The 
tree was reconstructed using ML method, with confidences of topology summarized from 5000 sampled trees from ML and NJ bootstrapping and 
BMCMC chains. Only confidence values of major clusters were shown (ML/NJ/BMCMC, in the parenthesis). The recent human SARS-CoV 
isolate GDO3T0013 (of the index patient) is indicated with an asterisk. GZ0402 was isolated from the second patient (a waitress) in the 2003— 
2004 epidemic. Accession numbers of the sequences are shown within round brackets after their strain names (in bold). The distance unit was 
substitutions/site. Rp3 isolated from bats was used as an out-group to root the tree, and the genetic distance of its branch is not shown. 
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Table | 


Comparison of pairwise nucleotide p-distances between the spike gene of strain GD03T0013 and other SARS-CoV and SARS-like-CovV isolated 


from human and wild animals 


Human epidemic cluster (2002-2003) 


Wild animal cluster 
A (2002-2003) 


Wild animal cluster 
B (2003-2004) 


GZ02, HSZ-Bc GZ60, GZ60, ZS-B, SZ3 
HSZ-Cc, JMD, HSZ2-A 
0.004630 0.004902 0.005719 


S$Z13, SZ16 PC4-115, HC/SZ/266/03, PC4-145 
GZ0401* 
0.005991 0.000272 0.000545 


Isolates shared the shortest distances to GD03T0013 are shown for each cluster (human epidemic cluster, wild animal cluster A, wild animal 
cluster B). The p-distances were calculated based on the sequence alignment (length = 3672 bp) using MEGA 3.1 Kumar et al. (2004). 

* GZ0401 (AY568539) was the complete genome sequence of the SARS-CoV isolated from the same patient (the index patient in 2003-2004 
epidemic) from which the GD03T0013 (AY525636) spike gene sequence was obtained. 


conclusion is supported by high confidence values. 
Table 1 shows the comparison of the pairwise genetic 
distances (nucleotide p-distances) between the spike 
gene of strain GDO3T0013 and other strains of our 
dataset. The genetic distance between civet cat SARS- 
like-CoV strain PC4-115 (and HC/SZ/266/03) from 
wild animal cluster B and strain GDO3TOO13 is 
smaller than that between any strains in the human 
epidemic cluster and the wild animal cluster A isolated 
during the 2002-2003 epidemic. Remarkably, only 
one nucleotide difference was identified between the 
spike genes of strains GDO3T0013 and PC4-115 (and 
HC/SZ/266/03). In the spike gene phylogeny (Fig. 1), 
PC4-115 (and HC/SZ/266/03) could be also inter- 
preted as the direct phylogenetic predecessor of 
GD03T0013. Although the first patient (GD03T0013) 
reported no contact with civet cats and other animals 
in the 2 months preceding the disease onset (Wang 
et al., 2005), these phylogenetic and genetic evi- 
dences, as well as those presented by previous studies 
(Kan et al., 2005; Song et al., 2005; Wang et al., 2005) 
suggested that the recent human case was a sporadic 
infection originating from the SARS-like-CoV of a 
wild animal. 

Our results, based on the genetic analyses of the 
spike gene, showed that the closest strains to the recent 
sporadic human SARS-CoV are the civet cat SARS- 
like-CoV in 2003-2004 epidemic rather than the civet 
cat SARS-like-CoV in 2002—2003 epidemic suggested 
by Zhao et al. (2004), or the human SARS-CoV from 
the earlier phase of 2002-2003 epidemic, or an 
unknown predecessor suggested by Wang et al. 
(2006). The major difference between Wang et al. 
(2006) and our analysis is the sample size and diversity 
of sequences used. They used fewer isolates and, more 
importantly, did not include the civet cat SARS-like- 


CoV sequences isolated in 2003-2004 epidemic 
(shown in Fig. | under the wild animal cluster B 
grouping) in their analysis. Consequently, they could 
not fully delineate the phylogenetic origin of the SARS- 
CoV in the recent sporadic human cases. We want to 
emphasize the importance of sample size and diversity 
in phylogenetic analysis, especially in any search for the 
possible origins of a disease causing agent. 
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